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In memory of Konstantin Alexandrovich Sevast'yanov, 1956-1984- 

Abstract 

We prove that any pair of bivariate trinomials has at most 5 isolated roots in the positive 
quadrant. The best previous upper bounds independent of the polynomial degrees were much 
larger, e.g., 248832 (for just the non-degenerate roots) via a famous general result of Khovanski. 
Our bound is sharp, allows real exponents, allows degeneracies, and extends to certain systems 
of n-variate fewnomials, giving improvements over earlier bounds by a factor exponential in the 
number of monomials. We also derive analogous sharpened bounds on the number of connected 
components of the real zero set of a single n-variate m-nomial. 
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1 Introduction 

Generalizing Descartes' Rule of Signs to multivariate systems of polynomial equations has proven to 
be a significant challenge. Recall that a weak version of this famous classical result asserts that any 
real univariate polynomial with exactly m monomial terms has at most m — 1 positive roots. This 
bound is sharp and generalizes easily to real exponents (cf. Section ||) . The original statement in Rene 
Descartes' La Geometrie goes back to June of 1637, and Latham and Smith's English translation 
states that this result was observed even earlier by Thomas Harriot in his Artis Analyticae Praxis 
(London, 1631) JgL54[ Footnote 196, Pg. 160]. Proofs can be traced back to work of Gauss around 
1828 and other authors earlier, but a definitive sharp bound for multivariate polynomial systems 
seems to have elluded us in the second millenium. This is particularly unfortunate since systems of 
sparse polynomial equations, and inequalities, now occur in applications as diverse as radar imaging 
JFH95| , chemistry pH99fl , and neural net learning ]VR02| . 

Here we take another step toward a sharp, higher-dimensional generalization of Descartes' bound 
by providing the first significant improvement on the case of curves in the plane, and certain higher- 
dimensional cases, since Khovanski's seminal work in the early 1980's [ Kho80| . Khovanski's rev- 
olutionary Theory of Fewnomials [Kho8G\ Kho91 extends Descartes' bound to a broader class 



of analytic functions (incorporating certain measures of "input complexity" ) as well as higher di- 
mensions, but the resulting bounds are impractically large even in the case of two variables. Our 
bounds are sharper than Khovanski's by a factor exponential in the number of monomial terms, 
allow degeneracies, and are optimal for the case of two bivariate trinomials. We then present similar 
sharpenings, also allowing real exponents and degeneracies, for the number of compact and non- 
compact connected components of the real zero set of a single sparse polynomial with any number 
of variables. 
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1.1 Main Results 



Perhaps the simplest way to generalize the setting of Descartes' Rule to higher dimensions and real 
exponents is the following: 

Notation. For any c G R* := R\ {0} and a = (ai, a n ) 6l", let x a := x^ 1 ■ ■ ■ and call cx a a 
monomial term. We will refer to R™ := {x <= R™ | x$ > /or a// i} as i/ie positive orthant. 
Henceforth, we will assume that F := (/i, . . . , /&) where, for all i, fa G R[x a | a S R™] and /j /ias 
exactly nii monomial terms. We call fi an n-variate mj-nomial and, when mi, . . . , rrik >1, we call 
F a k x n fewnomial system (over RJ of type (mi, . . . , m*,). We call any homeomorphic image 
of the unit circle or a (closed, open, or half-open) interval an arc. Finally, we say a real root ( of 
F is isolated (resp. smooth, non-degenerate, or non-singular ) iff the only arc of real roots of 
F containing Q is C itself (resp. the Jacobian of F, evaluated at has full rank), o 

Definition 1. For any mi, . . . , m„ gN, let A/"' (mi, . . . , m„) (resp. Af{m\, . . . , m„) ) denote the maxi- 
mum number of non- degenerate (resp. isolated) roots an nxn fewnomial system of type (mi,...,m n ) 
can have in the positive orthant. o 

Finding a tight upper bound on Af(mi, . . . ,m n ) for n > 2 remains a central problem in real 
algebraic geometry which is sti ll poorl y understood. For example, Anatoly Georievich Kushnirenko 



conjectured in the mid-1970's [ Kho80 that Af'(m±, . . . ,m„) = J|™ =1 (mi — 1), at least for the case of 



integral exponents. The nxn polynomial system 




-l 



<),..., H (Xn-i)] (1) 



=1 



was already known to provide an easy lower bound of Af' (mi, . . . , m„) > ri"=i( TO « — 1)> a l mos t 



30 years would pass until a counter-example to Kushnirenko's conjecture was published |Haa02| (see 
Formula (^|) in Section 1.2). However, it was known much earlier that Kushnirenko's conjectured 



upper bound could not be extended to J\f{m\, . . . , m n ): The trivariate polynomial system 
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x{z - l),y(z - 1), JIG* - *) 2 + 11^ - ? ) 2 ( 2 ) 



=1 



is of type (2,2,21), has exactly 25 (> 20 = 1 • 1 • 20) roots in the positive octant, all of which are 
isolated and integral, but with Jacobian of rank < 3 (see, e.g., [ Stu98| , note added in proof] or 



[Ful84, Ex. 13.6, Pg. 239]). Indeed, Af'(mi, . . . , m n ) < J\f(mi, . . . , m n ) for all mi, ... , m n , since a 
non-degenerate non-isolated root in R" can never have more than n—1 tangent planes with linearly 
independent normal vectors. Cases where the inequality A/"' (mi, . . . , m n ) <A/"(mi, . . . , m n ) is strict 
appear to be unknown. 

Interestingly, allowing degeneracies and real exponents introduces more flexibility than trouble 
in our approach: The proof of our first main result is surprisingly elementary, using little more than 
exponential coordinates and an extension of Rolle's Theorem from calculus. 

Definition 2. For any S'CR", let Conv(S') denote the smallest convex set containing S. Also, for 
any m-nomial of the form f{x):—^2 aeA c a x a , we call Supp(/) := {a | c a ^0} the support off, and 
define Newt(/) := Conv(Supp(/)) to be the Newton polytope of f. Finally, we let Z + (F) denote 
the zero set of F in R™ . o 

Theorem 1. We have AT' (3, 3)= N (3, 3) = 5 and, more generally: 

(a) A/"'(3,m)=A/"(3,m)<2 m - 2 for all m>4. 

(b) Any nxn fewnomial system F:= (/i, . . . , /„) of type (mi, . . . , m n _i, m) with 

(i) bi + Supp(/i), . . . , 6„_i + Supp(/„_i) C A for some b u ..., b„_i € R" and A C R" of 
cardinality n + 1 
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(ii) Z+(fi, . . . , f n -x) smooth 

has no more than n + n 2 + • • • + n™ 1 ^ 1 isolated roots in R™ , for all m,n>l. Also, for such F, 
the maximum number of non- degenerate and isolated roots in K™ are equal. 

(c) For any ot\, a^, as, 0,2, &2> C3, £^3, r%, Si, U2, i>2 €K and any degree D polynomial p£M\S\, S2] with 
Z+(p) smooth, the 2x2 fewnomial system 



to 



Qi + a2X a2 y b2 + a^x C3 y d3 
p(x ri y Sl ,x U2 y V2 ) 



has no more than 4Area(Newt(p)) + 2Z? + l (< 6D + 1 ) isolated roots in R^_, where we normalize 
area so that the unit square has area 2. 

The quantities M' {mi, . . . , m n ) and jV(mi, . . . , m n ) are much easier to compute when some m, is 
bounded above by 2: all families of polynomial systems currently known to admit explicit formulae, 
including n x n binomial systems, are summarized in Theorem [| of Section ^. 

Remark 1. The value o/A/"(3, 3) was previously unknown and there appears to be no earlier result 
directly implying the equality J\f'(3,m) = A/"(3,m) for any m. In particular, the only other upper 
bound on A/"'(3,to) or Af(3, m) until now was Af'{3, m) < 3™+ 2 2( m + 2 )( m+1 )/ 2 ; which evaluates to 
248832 when m = 3. The best previous bound for the systems described in (b) and (c) were respec- 
tively ( n+ l)™+«2( m+ ")( m+ ™- 1 )/ 2 and 1024L>(D + 2f , counting only the non-degenerate roots. (See 
Khovanski's Theorem on Real Fewnomials in Section 1 1. Jj and Proposition [I] of Section ^.j o 

Example 1. Note that while we still don't know an upper bound on A/ 7 (4, 4) better than Khovanski's 
4586471424, we at least obtain a new approach for certain fewnomial systems with many monomial 
terms. For instance, part (c) of our first main theorem tells us that the 2x2 fewnomial system: 

a l +a 2 x a2 y b2 +a 3 x C3 y d3 
0o + P-ix ri y Sl + (3 lX ri+U2 y Sl+V2 + p 2X ^+^) y 2(s 1+ v 2 ) + . . . + / 3 looX ioo(r 1+tl2 yoo( Sl+1 , 2 ) 

has no more than 801 = 4- 100 + 2-200 + 1 isolated roots in K + , for all aj, a^, hi, C3, efe, r\, s\, 112, V2 €M 
and Pi € M such that (3q + f3\u + • • • + /3ioo^ 100 has no degenerate roots. The pair of fewnomials 
^(y — l)(y — 2), y — Y[]^i( x ~ i)J easily shows us that a system in this family can have as many as 
200 non-degenerate roots, if not more. Khovanski's Theorem on Real Fewnomials below yields an 
upper bound of 68878994643353600 for just the number of non-degenerate roots, o 

We can also classify when a pair of bivariate trinomials has 5 isolated roots in the positive 
quadrant via Newton polygons. In particular, note that while one can naturally associate a pair of 
polygons to F when n — 2, we can also associate a single polygon by forming the Minkowski sum 
Pp :=Newt(/i) + Newt(/2)- We can then give the following addendum to Theorem |l|. 

Corollary 1. i 2 x 2 fewnomial system F of type (3,3) respectively has at most 0, 2, or 4 isolated 
roots in according as we restrict to those F with Pf a line segment, triangle, or i-gon with 
^€{4,5}. 

The central observation that led to Theorem |l| may be of independent interest. We state it as 
assertion (5) of Theorem ^ below. However, let us first define two more combinatorial quantities 
closely related to N' and Af. 

Definition 3. For any fi, n £ N, we say that a k x n fewnomial system with exactly fi distinct 
exponent vectors is /i-sparse. Also, let K,' (n, /i) (resp. /C(n,/i)j denote the maximum number of 
non- degenerate (resp. isolated) roots a pi-sparse n x n fewnomial system can have in the positive 
orthant. o 

Assertions (2) and (3) of our next main result dramatically refine the bounds of Oleinik, Petro- 
vsky, Milnor, Thorn, and Basu on the number of connected components of a real algebraic set 
[OP4E , Mil64, Tho65, |Bas99| in the special case of a single polynomial, and hold in the more general 



context of real exponents: 
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Theorem 2. Let f be any n-variate m-nomial and let P(n, to) denote be the maximum num- 
ber of connected components of Z + (f) over all n-variate m-nomials. Also let P CO mp(n,m) (resp. 
Pnon(n,m)) be the corresponding quantity counting just the compact (resp. non-compact) connected 
components. Finally, for any n, sx,v,2,V2 G R and any degree D polynomial p G M[5i, S2], let 
p{x, y) :=p(x ri y Sl , x U2 y V2 ). Then we have: 

0- P comp (n+l,2) = ) P non (n + l,2) = l, P comp (l, to) =m - 1, P non (l,m)=0, and P non (n, 0) = 1 
for all m, n> 1. 

1- Pcomp{n^ to) =0 and P nolL (n,m)=P non (m — 1,to)<P(to — 2, to) for 3<m<n + 1. 

2. max{[fj -n-l, (L^J - l)"}<Pco rop (n,m)<2L/C / (n,m)/2j /or all n>2, and the last 
multiple of 2 can be removed in the smooth case. Also, Z + (p) has no more than Area(Newt(p)) 
compact components, where we normalize area so that the unit square has area 2. 

3. max /to — 1, ([ 2 "A-1) J ~ l Y J | <Pnon(n,m)<2P(n- 1, to) for all n> 2. Also, Z+(p) has 
no more than 2D non- compact connected components. 

Furthermore, in the special case where n = 2 and Z+(f) is smooth, let /(to) (resp. V(m)) denote 
maximum number of isolate^ inflection points (resp. isolated points of vertical tangency) of Z + {f). 
Then we also have 

4- V{m) </C(2,to) for all m> 1, and Z + {p) has no more than Area(Newt(p)) isolated points of 
vertical tangency. 

5. I(m) < 3/C'(2, to) for all to < 3, and Z + {p) has no more than 3Area(Newt(p)) isolated inflection 
points. 

In particular, V(3)<1 and /(3)<3, even if Z + {f) is not smooth. 

Note that a non-compact connected component of Z + (f) can still have compact closure, since R™ 
is not closed in R n , e.g., {(xi, x%) 6R+ | x\ + x\ = l}. 

While the above bounds on the number of connected components are non-explicit, they are 
stated so they can immediately incorporate any advance in computing K!(n,p). So for a general 
and explicit upper bound independent of the underlying polynomial degrees now, one could, for 
instance, simply insert the explicit upper bound for /C'(n,/i) appearing in Khovanski's Theorem on 
Real Fewnomials (see Section 1.2 below) and the formula /C'(2, 4) = 5 implied by Theorem [l] (see 
Proposition [t] of Section ||). 

Corollary 2. Following the notation of Theorem ^, P(n,m) < K!(n,m) + 2P{n — 1,to) for all 
n>2. More explicitly, P(n, to) < Y%=o 2*/C(n - i, to) < n(n + i)™2 n - 1 2 m ( m - 1 ^ 2 . In particular, a 
tetranomial curve in "Mr, has no more than 4 compact connected components and no more thar^ 4 
non- compact connected components. I 

The bound above is already significantly sharper than an earlier bound of (2n 2 — ?i+l) m (2n) I1_1 2 m ( m_1 ^ 2 , 
which held only for the smooth case, following from [Kho91, Sec. 3.14, Cor. 5]. The bounds of Thco- 
rcm|^ also improve an earlier result of the middle author on smooth algebraic hypersurfaces |Roj00a, 
Cor. 3.1]. 

Our final main result shows us that we can considerably refine assertion (3) of Theorem ^ if we 
take advantage of the underlying polyhedral structure. 

Definition 4. For any w:=(w±, . . . ,w„)eK" and any compact set Bet", we let B w — the face 
of B with inner normal w — be the set of all x€B minimizing the inner product w ■ x. Finally, 
for any any n-variate m-nomial f of the form ^2 aeA c a x a , we let Init w (/) — the initial terms of 
f (with respect to w) — be J2a£A™ c aX a ■ 

relative to the locus of points of inflection (resp. points of vertical tangency)... 
2 Theorem actually yields an upper bound of 6 for the number of non-compact components so we cheated slightly 
by using Theorem ^ below to get the very last bound. 
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Recall that the dimension of a polytope P C W 1 is the dimension of the smallest subspace containing 
a translate of P and that a facet of an n-dimensional polytope is simply a face of dimension n — 1. 

Theorem 3. Lei / 6e any n-variate m-nomial f with n- dimensional Newton polytope. Assume 
further that Z + (luit w (f)) is smooth for all w£l"\{0}. Then the number of non- compact connected 
components of Z + (f) is no more than 

w a unit inner facet 
normal of Newt(/) 

where N w denotes the number of the number of connected components o/Z + (Init u ,(/)). In particular, 
this bound is no larger than ^ P(n— 1, #Supp(/)nQ). Finally, when n — 2 and Z+(f) is smooth 

Q a facet of P 

as well, the last upper bound is sharp and can be simplified to \_m' /2\ , where m! is the number of 
points o/Supp(/) lying on the boundary o/Newt(/). 

Note that Z + (f) need not be smooth and our bound above is completely independent of the number 

of exponent vectors lying in the interior of Newt(/). The bivariate example f(x, y) =y— Y\aLi 2 { x ~~ *) 
easily shows that the very last bound is sharp. A more intricate trivariate example follows. 

Example 2. Taking n = 3, suppose f is 

K 

at + a 2 x 3a + a 3 z 3c + a 4 x 3a z 3c + f3 1 x a y b z c + (3 2 x 2a y b z c + (3 3 x a y b z 2c + f3 4 x 2c y b z 2c + ^ llX a 'y b 'z c ', 

i=l 

where K is any positive integer, the otj, (3j, 7j are any nonzero real constants, a, 6,c>0 and, for all 
i, a < Oi < 2a, c<Ci < 2c, and < bi < b. Note that Newt(/) is a snub pyramid with a rectangular 
base and thus has the same face lattice as a cube. Note also that no exponent vector of f lies in the 
relative interior of any face of Newt(/) of dimension 1 or 2. It is then easily checked that 

(aiaj - a 2 a 3 a i )(j3il3l - PifoPijQiot, 0) ^ 0, 

where Q is a product of 4 more complicated polynomials, is a sufficient condition for all the Z + (Init-u, (/)) 
to be smooth. So, under the last assumption, Theorem tells us that the zero set of f in the positive 
octant has no more than 6P(2,4) < 6 • (4 + 6) = 60 non-compact connected components, employing 
corollary and the obvious fact that P(n,m) < fcomp(^) m ) + -Pnon(n, m) for the first inequality. 
Note that Theorem^ would have given us a less explicit upper bound o/P non (3,8 + K) which, by 
assertion (3), exceeds 60 for all K>?>7 (if not earlier), o 

Note also that the assumption on the \mt w (f) is rather mild: it follows easily from Sard's 



Theorem | Hir94 | that our smoothness condition will hold for a generic choice of the coefficients of 
/, e.g., all coefficient vectors outside a set of measure zero in C* Supp ^- ) depending only on Supp(/). 
In particular, this hypothesis can become vacuous depending on the underlying Newton polytope. 

Corollary 3. Following the notation above, assume that Newt(/) is simplicial (i.e., for all d< 
dimNcwt(/) every d- dimensional face o/Newt(/) has exactly d+1 vertices) and that [the relative 
interior of a face Q o/Newt(/) contains a point o/Supp(/) =>■ Q is a vertex]. Then Z + (Initu,(/)) 
is smooth for all wel"\{0}. ■ 

Corollary U follows easily from the fact that for such an /, and any vector w£l"\{0}, Z + (lmt w (f)) 
is analytically diffeomorphic to R™ _1 . The latter fact in turn follows easily via a monomial change 
of variables (cf. Proposition || of Section |^) . 

1.2 Important Related Results 



The only available results for bounding the number of real roots, other than those coming from 
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Fewnomial Theory [BC76, Gri82, Ris85, Kho91, Zel9E, RojOOa , depend strongly on the individual 
exponents of F and are actually geared more toward counting complex roots, e.g., [ BKK76] , Kaz81, 
BLR91, Roj9£]. (We also note that while the bounds of Zel99| generalize Khovanski's theory to 
solution sets of inequalities involving Pfaffian functions, they do not appear to yield any new 
bounds on the quantities JC and Af we study.) So proving just 7V(3, 3) < oo already suggests an 
analytic approach. Nevertheless, the bounds from [ BKK76] , Kaz81, |BLR91 , Roj99] can be quite 
practical when the exponents are integral and the degrees of the polynomials are small. 

Let us also point out that the term "fewnomial" is due to Kushnirenko and that the first ex- 
plicit bounds in Fewnomial Theory were derived (not yet in c omplete generality) by Konstantin 
Alexandrovich Sevast'yanov i n unpublished work around 1979 [ Kho02 ] . Dima Yu. Grigoriev and 
Askold Georgevich Khovanski have also pointed out that shortly after Kushnirenko formulated his 
conjecture, a simple counter-example with n = 2 was found by a student at Moscow State Univer- 
sity GriOO , Kho02 |. Unfortunately, while the counter-example was verified by Khovanski himself 
[ Kho02 1 , it does not seem to have been recorded and the name of its inventor (who left mathematics 
immediately after graduating) seems to have been forgotten. 

As for the size of upper bounds on the number of real roots, it is interesting to note that the 
best current general bounds independent of the polynomial degrees are exponential in the number of 
monomial terms of F, even for fixed n. Observe one of the masterpieces of real algebraic geometry. 



Khovanski's Theorem on Real Fewnomials . (See , Kho8L / and iKho91, Cor. 6, Pg. 80, Sec. 



3.12].) We have lC'(n,Li)<(n + l) f *2^~ 1 ^ 2 . More generally, the n X n fewnomial system 

Qi{x) = ■■■ = q n (x) = 0, 



where each qj is a polynomial of degree Di in x\ , . . . , x n and x ai , . . . , x a ^ for some aj., . . . , € K.™, 
has no more than 2 /i ( /i_1 )/ 2 (l + Y^h=i ^i)** Y\7=i non-degenerate roots in K™ . ■ 

Finding non-trivial lower bounds on even /C'(2, /i) seems quite hard and surprisingly little else is 
known about what an optimal version of Khovanski's Theorem on Real Fewnomials should resemble. 
For example, around 1996, Ilya Itenberg and Marie-Frangoise Roy propos ed a c onjectural polyhedral 
generalization of Descartes' Rule to multivariate systems of equa tions |j[R96 |, based on a famous 
construction fro m Oleg Viro's 1983 Leningrad thesis (see, e.g., [Vir84]) and later extensions by 
Bernd Sturmfels | Stu94 |. A bit later, Sturmfels offered US$500 for a proof that Itenberg and Roy's 
proposed upper bound held for the following family of 2 x 2 systems of type (4,4): 



(-x 5 + a iy 5 + a 2 x 3 y 5 + a 3 x 6 y 8 , -y 5 + b lX 5 + b 2 x 5 y 3 + b 3 x s y 6 ), 



(3) 



where 01,02,03,61,62,63 > 0. The Itenberg-Roy conjecture yields an alleged upper bound of 3 for 
this family, and Jeff Lagarias and Thomas Richardson later won Sturmfels' prize by showing that 
this bound in fact holds [LR97 . However, the Itenberg-Roy conjecture was later invalidated by the 
2x2 system 

{y-x-l,y 3 + Q.0lx 3 y 3 -9x 3 -2) (4) 

found by the left and right authors (Li and Wang) : this system has exactly 3 roots in , whereas the 
conjectured bound would have only been 2 [LW98]. Perhaps the most important counter-example 
in this growing theory is Haas' recent counter-example to Kushnirenko's Conjecture: It is 



108 



l.Lr 



54 



1.1a: 



o4 



l.Ln 



(5) 



which has 5 (> 4 = 2 ■ 2) roots in the positive quadrant [Haa02]. Jan Verschelde has also verified 
numerically via his software package PHCPACK [ Ver99 that there are exactly 108 2 = 11664 complex 
roots, and thus (assuming the floating-point calculations were sufficiently good) each root is non- 
degenerate by Bezout's theorem on intersections of complex hypersurfaces |Sha77, ex. 1, pg. 198]. 

As for asymptotic behavior, it is still unknown whether even IC / (2 1 fi) is polynomial in [i: even 
the special case of 2 x 2 fewnomial systems of type (3, m) is still open. Note also that this kind of 
polynomiality requires the number of variables to be fixed: the system (x\ — "ix\ + 2, . . . , x 2 — 3x n +2) 
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shows us that J\f'(3, . . . , 3) is already exponential in n. More to the point, it is also unknown whether 



a simple modification of Kushnircnko's conjectured bound (e.g., increasing the formula by a constant 
power or a factor exponential in n) would at last yield a true, sharp, and general improvement of 
Khovanski's Theorem on Real Fewnomials. The 2k x 2k fewnomial system 



1.1*5* 



l.llrf 



l.lxf 



1.1a*), 



than 



thanks to Haas' counter-example, easily shows that one needs at least an extra multiple no smaller 
if some Kushnirenko-like bound is to be salvaged. 
Another question with even deeper implications is whether there is an algorithm for approximat- 
ing the real roots of a fewnomial system whose complexity depends mainly on the number of real 
roots. Since all current algorithms for real-solving have complexity bounds essentially matching the 
analogous bounds for solving over the complex numbers, a positive answer would yield tremendous 
speed-ups, both practical and theoretic al, for r eal-solving. However, little is known beyond the 
special cases ofnxn bino mial sy stems [ RojOOa , Main Thm. 1.3] and univariate polynomials with 
3 monomial terms or less [RY02| : For these cases, one can indeed obt ain alg orithms beating the 
known lower bounds |Ren89| for solving over the complex numbers, and [ RY02| also shows that one 
can at least find the isolated inflection points and vertical tangents of a trinomial curve about as 
quickly. 

Let us conclude our introduction with a recent number-theoretic parallel: It has just been shown 
by the middle author [ R,oj02 that the number of geometrically isolated^ roots in C n of any /it- 
sparse k x n polynomial system, over any p-adic field C, is no more than 1 + (Ccn(iJ> — n) 3 log /i) , 
where Cc is a constant depending only on C (see also |Roj01] and the references therein for earlier 
results in this direction). In particular, since QCQ2, one thus obtains a bound on the number of 
isolated roots in C n which is polynomial in /i for fixed n, with C now any fixed number field. One 
should note that p-adic fields, just like R, are complete with respect to a suitable metric. So there 
appears to be a deeper property of metrically complete fields lurking in these quantitative results. 

Remark 2. Domenico Napoletani has recently shown that to calculate J\f' (mi, . . . , m n ) for any given 
(mi, . . . , m n ), it suffices to restrict to the case of integral exponents ^NapOlj. Here, we will bound 
Af(n + 1, . . . , n + 1, m) directly, in the aforementioned cases, without using this reduction, o 



1.3 Organization of the Proofs and Obstructions to Extensions 



Section ^| provides some background and unites some simple cases where Kushnirenko's conjectured 
bound in fact holds, and the equalities lC'(n, fi) =/C(n, fi) and TV' (mi, . . . , m n ) =N(mx, . . . , m„) are 
true. We then prove Theorem [l] in Sections [| and |^, and prove Theorem || in Section Proving 
the (restricted) upper bound on Af(n + 1, . . . ,n + l,m) turns out to be surprisingly elementary, 
but lowering the upper bound on 7V(3, 3) to 5 then becomes a more involved case by case analysis. 
Section f\ then applies a variant of the momentum map from symplectic/toric geometry (see, e.g., 
| pma70t pou70| and ]Ful93| , Sec. 4.2]) to prove Theorem |. 

Section |] gives an alternative geometric proof that 7v(3, 3) < 6. We include this second proof for 
motivational purposes since it appears to be the first known improvement over A/ 7 (3, 3) < 248832, 
and since it is the only approach we know which yields part (c) of Theorem |l[ 

The reader should at this point be aware that our results can of course be combined and inter- 
weaved to generate much more complicated examples (with more monomial terms, more complicated 
supports, and more variables) which admit upper bounds on the number of roots in R" significantly 
sharper than Khovanski's Theorem on Real Fewnomials (see, e.g., Theorem || and the paragraph 
after in the next section). Nevertheless, it should also be clear that there are still many simple 
fewnomial systems where nothing better than Khovanski's bound is available, e.g., the exact values 

3 A root is geometrically isolated iff it is a zero-dimensional component of the underlying zero set in C n , where C 
is the algebraic closure of C. 
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of TV' (4, 4) and A/"(4,4) remain unknown. So let us close with some brief remarks on the obstruc- 
tions to extending Theorem [l] to more complicated fewnomial systems. In particular, the two main 
techniques we use are (A) a recursion involving derivatives of certain analytic functions, and (B) an 
extension of Rolle's Theorem (cf. Section ||) to intersections of lines with certain fewnomial curves. 

Our technique from (A) succeeds precisely because the underlying recursion stops in a number of 
steps depending only on m and n. In particular, while one can apply the same technique to certain 
slightly more complicated systems (cf. the proof of part (b) of Theorem [l] in Section ^), applying 
the same technique to a system of type (4, m) results in a much more complicated recursion which 
won't terminate without strong restrictions on the exponents; and even then the number of steps 
begins to depend on the exponents. The geometric reason for this is that we in essence project our 
roots to a line to start our recursion, and such projected roots appear to satisfy sufficiently simple 
equations just for the systems defined in part (b) (see Remark |J of the next section). 

Our technique from (B) succeeds for the systems (/i,/2) coming from (*) precisely because (i) 
Z + (fi) is diffeomorphic to a line in a very special way, and (ii) the equations arising from checking 
inflection points and vertical tangents of Z+(f2) have a fewnomial structure very similar to that of 
fi. In particular, for the systems in (*), we construct our stated bound by a special application 
of Bernstein's Theorem | BKK76 in the 2x2 case. However, increasing the number of variables of 
p (i.e., the number of monomial terms of f%) leaves us with a system of equations apparently not 
reducible to Bernstein's Theorem. 

Nevertheless, we suspect that there are many similar improvements to Fewnomial Theory over 
R which are quite tractable, and we hope that our paper serves to inspire more activity in this area. 



2 The Pyramidal, Simplicial, and Zero Mixed Volume Cases 

Let us first note some simple inequalities relating the quantities JC' , /C, A/"', and Af. 
Proposition 1. We have (/i — l) n </C'(n, /z) < /C(n, n), 

M' (mi, . . . , m n ) <KJ 'in, mi H + m n — n + 1) <M'(m\ H V m n — 2n + 2, . . ., mi H + m n — 2n + 2), 

v V ' 

n 

and 

N(mi, . . . , m n ) </C(n, mi + ■ ■ ■ + m n — n + 1) <N(nii + • • • + m n — In + 2 mi + ■ ■ ■ + m n — 2n + 2) . 

n 

where we set Af(mi, . . . , m n ) =A/ 7 (mi, . . . , m n ) = if any mi is negative. In particular, by Theorem 
|, we thus have K' (2, 4) =/C(2, 4) = 5. ■ 

Indeed, the last two "left-hand" inequalities follow simply by dividing each fi by a suitable monomial, 
while Gaussian elimination on the monomial terms of F easily yields the last two "right-hand" 
inequalities. 

Let us next give a simple geometric characterization of certain fewnomial systems that admit 
easy root counts. 

Definition 5. Let us call any collection Li S • • • ^L n =W n of n non-empty subspaces ofW 1 (so that 
dim Li = i for all i) a complete flag. Noting that any polytope in R™ naturally generates a subspace 
ofW via the set of linear combinations of all differences of its vertices, let F — (/i, . . . ,/„) be annx 
n fewnomial system and, for all i, let Li be the linear subspace so generated by Newt(/i). We then say 
that F is pyramidal iff the Newton polytopes of F generate a complete flag. Finally, letting A: = [ojj] 
be any realnxn matrix, x:=(xi,.. .,x n ), y:={y\,.. .,y n ), and y A := (y^ 11 ■ . ■ ,yl ln ■ ■ ■ y^ nn ), 

we call any change of variables of the form x = y A a monomial change of variables, o 

For example, the systems from ( |l|) ( cf. Section IT) are pyramidal, but systems (||) (cf. Section 
1.1), (Q), (|J), and (||) (cf. Section p~^) are all non- pyramidal. Note in particular that all binomial 



systems are pyramidal, but a 2 x 2 fewnomial system of type (3, 3) certainly need not be pyramidal. 
Pyramidal systems are a simple generalization of the so-called "triangular" systems popular in 
Grobner-basis papers on computer algebra. The latter family of systems simply consists of those F 
for which the variables can be reordered so that for all i, fi depends only onx\,...,Xi. Put another 
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way, pyramidal systems are simply the image of a triangular system (with real exponents allowed) 
after multiplying the individual equations by arbitrary monomials and then performing a monomial 
change of variables. 

Recall that an analytic subset of a domain U C R™ is simply the zero set of an analytic function 
defined on U. We then have the following fact on monomial changes of variables. 

Proposition 2. If x:=(xi, . . . , x n )GR" and A is a real invertible n x n matrix, then (x A ) A =x 
and the monomial map defined by x t-y x A is an analytic automorphism of the positive orthant. 
In particular, such a map preserves smooth points, singular points, and the number of compact and 
non-compact connected components, of analytic subsets of the positive orthant. Furthermore, this 
invariance also holds for real m-nomial zero sets in the positive orthant. ■ 

The assertion on analytic subsets follows easily from an application of the chain rule from calculus, 
and noting that such monomial maps are also diffeomorphisms. That the same invariance holds 
for m-nomial zero sets follows immediately upon observing that the substitution (x\ , . . . , x n ) — 
(e zi , . . . , e Zn ) maps any n-variate real m-nomial to a real analytic function, and noting that the map 
defined by (ti, . . . ,t n ) i— *■ (e* 1 , ...,e tn ) is a diffeomorphism from R™ to R". 

Remark 3. The real zero set of x\ +X2 — 1, and the change of variables (xi,X2)= ^^,2/12/2^ > show 
that the number of isolated inflection points need not be preserved by such a map: the underlying 
curve goes from having no isolated inflection points to having one in the positive quadrant, o 

We will later need the following analogous geometric extension of the concept of an over- 
determined system. 

Definition 6. Given polytopes Pi, . . . , P„cR n , we say that they have mixed volume zero iff for 

some d G {0, . . . , n — 1} there exists a d-dimensional subspace of R" containing translates of Pi for 
at least d + 1 distinct i. o 

The mixed volume, originally defined by Hermann Minkowski in the late 19— century, is a nonneg- 
ative function defined for all n-tuples of convex bodies in R™ , and satisfies many natural properties 
extending the usual n-volume. The reader curious about mixed volumes of polytopes in the context 
of solving polynomial equations can consult [ BZ88| , Roj99[ ] (and the references therein) for further 



discussion. A simple special case of an n-tuple of polytopes with mixed volume zero is the n-tuplc 
of Newton polytopes of an n x n fcwnomial system where, say, the variable Xi does not appear. 
By multiplying the individual m-nomials by suitable monomials, and applying a suitable monomial 
change of variables, the following corollary of Proposition || is immediate. 

Corollary 4. Suppose F is a fewnomial system, with only finitely many roots in the positive orthant, 
whose n-tuple of Newton polytopes has mixed volume zero. Then F has no roots in the positive 
orthant. ■ 

Indeed, modulo a suitable monomial change of variables, one need only observe that the existence of 
a single root in the positive orthant implies the existence of an entire ray of roots (parallel to some 
coordinate axis) in the positive orthant. 

We will also need the following elegant extension of Descartes' Rule to real exponents. It's proof 
involves a very simple induction using Rolle's Theorem (cf. the next section) and dividing by suitable 



monomials [Kho91| — tricks we will build upon in the next section. 



Definition 7. For any sequence (ci, . . . , c m ) el™, its number of sign alternations is the number 
of pairs {1, . . . , to} such that j < f , CjCf <0, and Ci = when j <i<j' . o 

Univariate Generalized Descartes' Rule of Signs (UGDRS). Let cj., a\, . . . , c m , a m be any 

real numbers with a\ < ■ ■ ■ < a m . Then the number of positive roots of Y^L 1 cix\ l is at most the 
number of sign alternations in the sequence (ci, . . . , c m ). In particular, KJ(1, m) =/C(l, to) =J\f'{rri) = 
jV(to) = m—l. I 
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As a warm-up, we can now prove a stronger version of Kushnirenko's conjecture for certain 
fundamental families of special cases. In particular, we point out that aside from the domain of 
Theorem [j], the equalities /C'(n, fx) = IC(n, fi) and A/ 7 (mi, . . . , m n ) — Af(mi, . . . , m„) appear to be 
known only for the cases stated in UGDRS and assertions (0), (2), and (4) below. 

Theorem 4. Suppose F is an n x n fewnomial system of type (mi, . . . , m n ) (so mi, . . . , m n >l) 

and consider the following independent conditions: 

(a) The n-tuple of Newton polytopes of F has mixed volume zero. 

(b) All the supports of F can be translated into a single set of cardinality <n+ 1. 

(c) F is pyramidal. 

Then, following the notation of Theorem |J, we have: 

0. J\f(mi, . . . , m n ) is 0, 1, or nr=i( TO « — -0 */ we respectively restrict to case (a), (b), or (c). 
Also, in all these cases, A/"' (mi, . . . , m n ) =A/"(mi, . . . , m n ). 

1. If (a), (b), or (c) hold then [F has infinitely many roots F has no isolated roots]. 

2. M' '(mi, TO2, . . . , m n ) =Af(mi, m 2 , . . . , m n ) = some mi is < 1 

3. m\=2 [Af(mi,m 2 , ■ ■ ■ ,m n )—N{m 2 , ■ ■ ■ ,m n ) and Af' (m 1 ,m 2 , ■ ■ ■ , m n ) =A/"'(m 2 , . . . ,m n )]. 
In particular, Af'(2, . . . , 2) =J\f(2, . . . , 2) = 1. 

4- /C'(n, fi) =/C(n, /x) < 1 jtx< n + 1, and equality holds iff fi = n + 1. 

One should of course note the obvious fact that J\f and A/ 7 are symmetric functions in their arguments. 
Note also that conditions (a), (b), or (c) need not hold in assertions (2)-(4). 

Proof of Theorem ^: First note that the Newton polytopes must all be non-empty. The case (a) 
portion of assertions (0) and (1) then follows immediately from Corollary^. Note also that the case 
(a) portion of assertion (0) implies the "<^" direction of assertion (2) , since the underlying n-tuple 
of polytopes clearly has mixed volume zero. The direction of assertion (2) then follows easily 

from our earlier examples from Section [l[ The case (b) portion of assertions (0) and (1) follows easily 
upon observing that F is a linear system of n equations in n monomial terms, after multiplying the 
individual equations by suitable monomial terms. We can then finish by Proposition ||. 

To prove the case (c) portion of assertions (0) and (1), note that the case n=l follows directly 
from UGDRS. For n > 1, we have the following simple proof by induction: Assuming the desired 
bound holds for all (n — 1) x (n — 1) pyramidal systems, consider any n x n pyramidal system F . 
Then, via a suitable monomial change of variables, multiplying the individual equations by suitable 
monomials, and possibly reordering the fi, we can assume that f\ depends only on x\. (Otherwise, 
F wouldn't be pyramidal.) We thus obtain by UGDRS that f\ has at most mi — 1 positive roots. 
By back-substituting these roots into F' :— (f 2 , ■ ■ ■ , f n ), we obtain a new (n' — 1) x (n'— 1) pyramidal 
fewnomial system of type {m' 2 , ■ ■ ■ , m' n , ) with n' <n and m' 2 < m 2 , . . . , m' n , < m n i . By our induction 

hypothesis, we obtain that each such specialized F' has at most n"=2( m i — -0 isolated roots in the 
positive orthant, and thus F has at most n"=i( TO i — -0 isolated roots in the positive orthant. (We 
already saw in the introduction that this bound can indeed be attained.) 

Our recursive formulae for A/"' (2, m 2 , . . . , m n ) and A/"(2, m 2 , . . . , m„) from assertion (3) then follow 
by applying just the first step of the preceding induction argument, and noting that Proposition ^ 
tells us that our change of variables preserves non-degenerate roots. 

Assertion (4) follows immediately from cases (a) and (b) of assertion (0). ■ 
One can of course combine and interweave families (a), (b), and (c) to obtain less trivial examples 
where we have exact formulae for J\f{m\, . . . , m n ) and IC(n, /i). More generally, one can certainly 
combine theorems [j] and ^ to obtain bounds significantly sharper than Khovanski's Theorem on Real 
Fewnomials, free from Jacobian assumptions, for many additional families of fewnomial systems. 
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3 Substitutions and Calculus: Proving Theorem [j] Minus 
Part (c) 

Let us preface our first main proof with some useful basic results. 

Lemma 1. Let F = (/i, . . . , f n ) be any n x n fewnomial system of type (mi, . . . , m n ) with m\ 
I + dimNewt(/i). Then there is another n x n fewnomial system G — (gx, . . . ,g n )> also of type 
(mi, . . . ,m n ), such that G has the same number of non- degenerate (resp. isolated) roots in Wt as 
F, gi := 1 ± xi ± • • • ± x mi _i (with the signs in g\ not all "+") and, for all i, gi has 1 as one of 
its monomial terms. In particular, for mi =3, we can assume further that gi :=1 — X\ — x^. 

Proof: By dividing each /, by a suitable monomial term, we can assume that all the fi possess 
the monomial term 1. In particular, we can also assume that the origin O is a vertex of Newt(/i). 
Note also that the sign condition on g\ must obviously hold, for otherwise the value of g\ would be 
positive on the positive orthant. (The refinement for m = 3 then follows by picking the monomial 
term one divides f\ by more carefully.) So we now need only check that the desired canonical form 
for gi can be attained. 

Suppose fx := 1 + cix ai + ■ • • + c mi ^ix am i- 1 . By assumption, Newt(/i) is an TOi-simplex with 
vertex set {O, a%, . . . , a mi _i}, so oi, . . . , a mi -i are linearly independent. Now pick any a mi , . . . , a n G 
K™ so that ai, . . . , a n are linearly independent. The substitution x i— > x a (with A the n x n matrix 
whose columns are ax,..., a„) then clearly sends fx i— > 1 + CiXi + • ■ • + c mi _ix mi _i, and Proposition 
U tells us that this change of variables preserves degenerate and non-degenerate roots in the positive 
orthant. Then, via the change of variables (xi, . . . , x mi -x) >— * (xi/|ci|, . . . ,x mi -x/\cm x -x\)-> we 
obtain that g\ can indeed be chosen as specified. (The latter change of variables preserves degenerate 
and non-degenerate roots in the positive orthant for even more obvious reasons.) ■ 

Recall that a polynomial pE M[xi, . . . , x n ] is homogeneous of degree D iff p(axi, . . . , ax n ) = 
a D p(xi, . . . ,x„) for all o£R. 

Propositions. Suppose p G R[Si, S n ] is homogeneous of degree D>0. Also let otx,Ux,Vx, ... ,a n ,u n ,v n £ 
M. Then there is a homogeneous q£&.[Si, . . . , S n ], either identically zero or of degree D + n — 1, such 

that j- t (p{ui + v x t, ...,u n + v n t) n™=i( u j + v jt) a:> J =q(ux+Vxt, ■ ■ .,u n + v n t) ]l"=i( u i + Vjt)^' 1 . 

Proof: By the chain-rule, ^ (p{u\ + v\t, ...,u n + v n t) \~\™ =1 (uj + Vjt) aj ^j is simply 

(E"=i v jPi( u i +vit,. ..,«„ + v n tj\ \ W = x( u i+ v i t ) a:i ) +p(ux + vxt,...,u n + v„t) fciLi ^* '^iH+tT ' ) 
where pi denotes the partial derivative of p with respect to Si. Factoring out a multiple of 
n™ =1 (itj + Vjt) a i~ from the preceding expression, we then easily obtain that we can in fact take 

q(Si, ...,S n ) = (vxpx(Sx, <5n)H Vv n p n (Si, S n )){Sx ■ ■ ■ S n )+p(Sx, ■ ■ ■ , S n ) fe"=i a i v i Sl s f° ) ■ 

So we are done. I 

Rolle's Theorem. (1691) Let g : [a,b] — ► K be any continuous function with a derivative g' 
well-defined on (a,b). Then g has r roots in [a,b] ==>■ g' has at least r — 1 roots in (a,b). ■ 

Lemma 2. Let m>2. Then for any real c\,u\,v\, ... ,c m ,u m ,v m and [ay], the function 

m n 

f(t) : »1K ' ''.• / ; " 

i=i j=i 

has no more than n + • • • + n m ~ 1 roots in the open interval I :={t €R+ | Uj + Vjt>0 for all j}. 

Furthermore, for any ax , ■ ■ ■ , a n G K, / has exactly r roots in I implies that there exist c\ , . . . , c m G 
K such that 

f(t) '■— T^iLi Ci Ilj=i( u i + Vjt) aij has at least r roots in I, no root of f in I is degenerate, and 
no root of f in I is an isolated root of ^Iljli( u j + v j^) aj ^j fj ■ 
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Proof: Throughout this proof let us consider only those roots lying in the open interval I and 
assume that / has exactly r roots in /. We will in fact prove a stronger statement involving an extra 
parameter D and then derive our lemma as the special case D = 0. 
First note that if 



: = y^Pijui + «i*> ■■■,u n + v n t) Y[(uj + Vjt) a * j 

i=l 3=1 

for some homogeneous polynomials pi, ■ ■ ■ ,p m of degree D, then 

m n 

go(t) i^p^ui + vit, ...,u n + v n t) + ^pi(ui + vit, ...,u n + v n t) + v 3 t) a '^ ai] 

1=2 j=l 

has the same number of roots in / as g. In particular, using D + 1 applications of Rolle's Theorem 
and Proposition ||, it is clear that g\ := gy D+1 ^ has at least r — (D + 1) roots, and we can in fact 
write 

m— 1 n 

i=l j=l 

for some array [a^] and homogeneous polynomials qi, . . . , q m of degree D + [D + l)(n — 1). 

Now let j4(m, -D) denote the maximum number of isolated roots of g in the interval /. By what 
we've just observed, we immediately obtain the inequality 

A(m, D) < A(m - l,nD + n - 1) + D + 1, 

valid for all m > 2, n > 1, and D > 0. That D) < D is clear, so one can then begin to bound 
Aim, D) for general m by recursion. A simple guess followed by an easy proof by induction yields 

A(m, D) < (1 + n + ■ ■ ■ + n m )(D + 1) - 1, 

which is valid for all m, n> 1 and D>0. So the first assertion is proved. 

To prove the second part, note that the first part of our lemma implies that / has only finitely 
many critical values (i.e., values f(x) with f'(x) — 0) — no more than n + ■ ■ ■ + n" 1 ^ 1 , in fact. 

Similarly, for any a±, . . . , a m el, there will only be finitely many roots for ^njli( u j + vjt) aj ^ f'J , 

unless this function is identically zero. In the latter case, no root of f ^11^=1 ( u j + v jt) aj ^J /') is 

isolated. So let us pick a±, . . . , a m £R so that ^njli( u i + v jt) aj ^J /') i s not identically zero. 

Note then that for all 8 £ R* with \S\ sufficiently small, / — ^n?=i( u j + Vjt) aij will have no 
degenerate roots in / and no toots in 7 ,na k i„ g ((n™ ^ + v^jf)' vanish. We can in tact 

guarantee that / — S YYj=i ( u j + Vjt) aij will also have at least r non-degenerate roots in I as follows: 
Let n + (resp. n_) be the number of roots t el of / with /'(i) = and /"(f) > (resp. /"(i) < 0). 
Clearly then, for all 6 € R* with \6\ sufficiently small, / — 5Y^^ =1 (uj + Vjt) aij will have exactly 
r + n~ — n+ or r + n + — n_ roots in /, according as 6 > or 5 < 0. (This follows easily upon 
dividing through by Y[j=i ( u j + Vjt) ai i .) So let <5 be sufficiently small, and of the correct sign, so 
that / — fiYYj=i( u j + Vjt) aij has at least r roots in /, no degenerate roots, and no roots making 

((n™iK +V) Qj )/')' vanish. 

To conclude, simply let c\ = c\ — S and c, :=c^ for all i> 2. ■ 

Proof of Theorem Q (Minus Part (c)): We will reduce part (a) to part (b), prove part (b), and 
then refine our argument until we obtain A/"'(3, 3) =J\f(3, 3) = 5. 

First note that in part (a), a simple Jacobian calculation reveals that the only way that Z + (fi) 
can be degenerate is if /i is the square of a binomial. (Indeed, if Newt(/i) is a triangle then Lemma 
|l| implies that Z + (fi) is diffeomorphic to a line.) Part (0) of Theorem [|then shows that our bound 
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from (a) is easily satisfied in the special case where f\ is a trinomial with Newt(/i) a line segment, 

so we can assume Newt(/i) is a triangle. Since 2 + 4 H h 2 m_1 = 2™ - 2, it then clearly suffices 

to prove part (b). 

To prove part (b), first note that UGDRS implies the case n = 1, so we can assume n > 2. 
Also, from the last paragraph, we already know that we can assume Vol ( Newt (/i)) > when n = 2. 
Since F has no isolated roots when n > 2 and the mixed volume of Newt(/i), . . . , Newt(/„_i) 
is zero (via part (0) of theorem ^ again), we can assume henceforth that the mixed volume of 
Newt(/i), . . . , Newt(/„_i) is positive. Since the supports of /i, . . . , f n -i can then all be translated 
into the vertex set of an n-simplex, Proposition [2| tells us that we can assume in addition that 
fii ■ ■ ■ i fn—i are affine functions of x\, . . . , x n . Letting f n (xi, . . . , x n ) — YnLi °i x j %3 > we can 
then simply solve for X2, ■ ■ ■ ,x n as functions of xi by applying Gaussian elimination to the first n— 1 
equations. Substituting into the last equation we then obtain a bijection between the roots of F 
in the positive orthant and the roots of f(t) := Y^iLi c % nj=i( u j + Vjt) aii in the interval I := {t E 
K + | Uj + Vjt>0 for all j}, where Ui,vi, . . . ,u n ,v n are suitable real constants. 

A simple Jacobian calculation then yields that (Ci, ■ • ■ , ( n ) is a degenerate root of F iff 

. . , . . . =0and/(d)=0 

and the above assertion is clearly true iff f'(Ci) = / (Ci) = 0. So degenerate (resp. non-degenerate) 
roots of our univariate reduction correspond bijectively to degenerate (resp. non-degenerate) roots 
of F. Part (b) then follows immediately from Lemma ||. 

To now prove that A/"(3, 3) = 5, thanks to Haas' counter-example, it suffices to show that jV(3, 3) < 
6. To do this, let us specialize our preceding notation to (m,n) = (3,2), (c\,C2) = (—A,—B), 
(tii, vi,v,2, V2) = (0, 1, 1, — 1), and (an, a 12, 021, 022) = (a, b, c, d), for some a, 6, c, deR and positive ^4 
and B. (Restricting A, B, u%, v\, U2, V2 as specified can easily be done simply by dividing /2 by a 
suitable monomial term, as in the proof of Lemma |l|.) In particular, the open interval / becomes 

(0,1). 

By using symmetry we can then clearly reduce to the following cases: 
A. a, b, c > and d < B. a, c > and 6, d < 

C. a, b > and c, d < D. a, 6, c, d > 

E. a, b, c, d < F. a > and 6, c, d < 

G. a, d > 0, b, c < H. At least one of the numbers a, 6, c, d is zero. 

Let g(t) := jjt 1-c (l — t) 1 ^ d f'(t). Then Lemma |^ and our earlier substitution trick tells us that 
it suffices to show that any 

f(t) := 1 - At a (l - t) b - Bt c (l - t) d , 

with all roots non-degenerate and no root of / an isolated root of g', always has strictly 
less than 6 roots in the open interval (0, 1). So let r be the maximum number of roots in (0, 1) of 
any such /. 

Let us now prove r < 6 in all 8 cases: 

A. a, b, c > 0, d < 0: 

Let Q(x) = 1 — Ax a (l — x) b and R(x) — Bx c (l — x) d . The roots of / may be regarded as the 
intersections in the positive quadrant of the parametrized curves y = Q(x) and y = R(x). Since 
lim :I ,^o+ Q( x ) — lj lini^-,]- Q( x ) = L hm a ._ +0 + R(x) = 0, and lim^^i- i?(x) = 00, it is easy 
to see via the Intermediate Value Theorem of calculus that the number of intersections must 
be odd. (One need only note that / = Q — R and that the signs of /' at the ordered roots of 
/ are nonzero and alternate.) So r<6. 

B. a, c> 0, 6, d < 0: 

Almost exactly the same argument as case A will work here. The only difference here is that 
linx^!- Q(x) = — 00. 

C. a, b > 0, c, d < 0: 
See Lemma || below. 
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D. a, b,c,d> 0: 

See Lemma ^ below. 

E. a, b,c,d < 0. 

Multiplying f(t) by t max ^~ a ~ c ^(l - fj™ax{-b-d} ^ we can i mmec iiately reduce to case D. 

F. a > 0, b, c, d < 0: 
See Lemma ^ below. 

G. a,d > 0, b,c < 0: 
See Lemma fj] below. 

H. At least one of the numbers a, b, c, d is zero: 

Use Lemma |^ below, noting that our hypotheses here imply that either F or F is a quadratic 
polynomial. 

This concludes the proof of Theorem [j], except for part (c) , which we will complete in Section ||. ■ 

Remark 4. Note that while we can attempt the same substitution trick for more complicated F, the 
complexity of the resulting recursion (involving derivatives and Rolle's Theorem) increases substan- 
tially. For instance, applying our proof in the special case where n — 2 and fi(x, y) — 1 + x + cx a — y 
unfortunately results in taking a number of derivatives which depends on a, thus obstructing a bound 
on the number of roots which is independent of the exponent vectors, o 

We now detail the lemmata cited above. 

Lemma 3. Following the notation of the proof of Theorem [J, recall that 

g{t) := ^t a - c (l - t) b - d (-a(l - t) + bt) - c(l - t) + dt and that r is the number of roots of f(t) := 
1 — At (\ — t) h — Bt c (l — t) d in the open interval (0, 1), where f has no degenerate roots and no root 
of f is an isolated root of g' . Also let 

F(u) := ~a{a-c){a-c~\)u 3 + (a - c) [2a(6 - d + 1) + b(a - c + l)]u 2 + (d - b) [a(b - d + 1) + 2b(a - 
c + l)]u + b(b - d)(b -d-l), and 

F(u) := -c(c-a)(c-a-l)u 3 +(c-a)[2c(d-b+l) + d(c-a+l)]u 2 +(b-d)[c(d-b+l) + 2d(c-a + 
l)]u + d(d— b)(d—b—l). Finally, let N (resp. M) be the maximum number of non-degenerate roots 
in (0, 1) of g (resp. the maximum of the number of positive roots of F and F), over all (a, b, c, rf)eR 
and (A,B)eWL 2 + . Then r - 3<N - 2<M<3. 

Proof: Just as in the proof of Lemma ||, we easily see by Rolle's Theorem and division by suitable 
monomials in t and 1—t that r— 1 is no more than the number of roots in (0, 1) of g. So r— 1 < N. Note 
also that, in a similar way, r— 1 is no more than the number of roots of g(t) := ^t c ~ a (l — t) d ~ b g(t) in 
(0, 1), and the latter function has the same number of roots (all of which are of course non-degenerate) 
in (0, 1) as g. 

To conclude, simply note that for suitable a, /3, 7, JsR, we have that F(=-^) =t a (l — t)^g"(t), 
F(^-r^) =i 7 (l — t) s g"(t), and both expressions are cubic polynomials in t. So, by our preceding 
trick again, N — 2 < M, and thus r — 3 < M. That M < 3 is clear from the fundamental theorem of 
algebra. ■ 

Lemma 4. Following the notation of Lemma let T(x) := ^x a ^ c (l — x) b ~ d (bx — a(l — x)), 
S(x) := c- (e+ d)x, t(x) := § x c - a {\ - x) d ~ b {dx - c(l - x)), and S{x) := a - (a + b)x. Then 
[a.b > and c.d < 0] =>• r < 6. 

Proof: By Lemma [| we are done if M < 3 or N < 5. So let us assume M = 3 to derive a 
contradiction. By Descartes' Rule of Signs (see Section ^ for a generalization), the coefficients of 
F(u) or F(u) (ordered by exponent) must have alternating signs. Thus, since a, a — c, b, b — d > 0, 
we have that a — c — 1 and b — d — 1 must have the same sign. We then need to discuss two cases: 

• a — c — 1 < and b — d — 1 < 0: 

This implies c — a + 1 > and d — b + 1 > 0. Consequently, the coefficients of u 3 and u 2 in 
F(u) and F(u) are all positive — a contradiction. 
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• a — c — 1 > and b — d — 1 > 0: 

The roots of g in (0, 1) can be regarded as intersections of y = T(x) and y = S(x), for 0<x<l. 
Since T({0, 1}) = 0, — a(l — a;) + 6a; = (a + b)x — a is strictly increasing, and —a < 0, we 
must have that there is a smallest positive local minimum Co of T with T(cq) < 0. Thus 
for x near c , T"(x) > 0. Since T"(x) < for < x <C 1, there is c* <E (0, c ) such that 
T"(c*) = 0. Let (xi, j/i), . . . , (xk,Uk) be the intersection points of y = T(x) and y = S(x) with 
xi < < ■ • ■ < Xif. (A simple Jacobian calculation shows that (xi,yi) is a degenerate root 
<^=^ X{ is a degenerate root of g. So every (xi,yt) is in fact a non-degenerate root.) Then 
for all i G {1, . . . ,K — 1} there is a c; 6 (xi,Xj+i) with T'(a) = — (c + d) > 0, and for all 
i G {1, . . . , K — 2} there is a di € (cj, Cj+i) with T"(di) = 0. Note that c < ci. Thus c* < d\ 
and therefore T"(x) = has at least K — 1 solutions. Since T" and F have the same number 
of positive roots (observing that T"(u)/F(u) is a monomial in u and 1 — u), we must have 
N- 1<K- 1<3. ■ 

Lemma 5. Following the notation of Lemma 0, a, 6, c, d > ==> r < 6. 

Proof: Again, by Lemma |[ we need only show that M<3 or N <5. So let us assume M = 3. Then 
by Descartes' Rule of Signs, (a — c) (a — c — 1) and (6 — d) (b — d — 1) in the coefficients of u 3 and u° 
in -F(u) must have the same sign. There are now four cases to be examined. 

• The signs of a — c, a — c — 1, b — d, and b — d — 1 are respectively +, — , +, and — : 
This makes the signs of coefficients of u 3 and u 2 of F(u) both positive. 

• The signs of a — c, a — c — 1, 6 — d, and b — d — 1 are respectively —,—,+, and +: 

Since b — d > 0, we have d — b < and d — b — 1 < 0. This makes the constant term of F(u) 
positive, and hence, the coefficients of u and u 2 of F{u) must respectively be negative and 
positive. That is, c(d- b + 1) + 2d(c - a + 1) < and 2c(d - 6 + 1) + d(c - a + 1) > 0. Thus, 
-c(d — b + 1) + d(c — a + 1) < 0. This is false, since 6 - d - 1 > and a - c - 1 < 0. 

• The signs of a — c, a — c — 1, 6 — d, and 6 — d — 1 are all negative: 

By Descartes' rule of signs, d — b — 1 and c — a — 1 in the coefficients of y 3 and y° of -F(y) 
must have the same sign. If both are negative, then coefficients of u 3 and u 2 of F(u) would 
both be negative. Thus d — b — 1 > and c — a — 1 > 0. It is easy to see that T{x) < for 
< x < 1 and f (x) >0for0< 1 - x < 1 and lim :E ^ 0+ f (x) = lim x ^i- T{x) = 0. Now let 
Lo = min{c | 1 >c>0, T(c)<0 and c is a local minimum of T} and 

f/o = max{c | 1 > c > L$, c is a local maximum of T}. Then for x near Lq, T"{x) > 0. 
Since f"(x) < for < x < 1, there exists Li G (0, L ) such that T"(Li) = 0. Similarly, there 
is a t/i G (U Q , 1) such that T"'(f7i) = Q. 

The roots of ^t c ~ a (l — t) d ~ h g can be studied via the intersections of y = T(x) and y — S(x), 
for < x < 1. Let (xi,?/i), . . . , (xk,yk) be these intersection points, where x\ <x 2 < • • • <Xfe. 
(A simple Jacobian calculation shows that (x^, j/j) is a degenerate root x, is a degenerate 
root of g. So every (xj,j/i) is in fact a non-degenerate root.) Then there are ci,...,Cfc_i 
with Ci G (xj, Xi+i) and T'(cj) = — (a + 6) < for all i G {1, . . . , k — 1}, and d%, . . . , dk-2 with 
di G (cj, Cj+i) and T"(di) = for alliG {1, . . . , fc — 2}. If xi >Lo, then L\ < d\. If x\ < Lq, then 
T(xi) < 0. This implies T(xi) < for alHG {1, . . . , k - 2}, since the slope -(a + b) of S(x) is 
negative. Therefore, Xk-2 < Uq and hence dk-2 <U\. So T"(x) = has at least fc — 1 solutions. 
Since T"(x) = and F(y) — Q have the same number of solutions, we have N— l<fc — 1<M = 3. 

• The signs of a — c, a — c — 1, 6 — d, and 6 — d — 1 are all positive: 

Since a — c — 1 > and b — d — 1 > 0, the proof follows almost exactly the same line of reasoning 
as the last case, by intersecting the graphs of T and S instead of T and S. ■ 

Lemma 6. Following the notation of Lemma 0, [a > and 6, c, d < 0/ => r < 6. 
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Proof: Once again, by Lemma |3|, it suffices to show that M < 3 or N < 5. So let us assume 
that M = 3. By checking the coefficients of u 3 and u° in F(u), Descartes' Rule of Signs tells us that 
a — c — 1 and (b — d)(b — d— 1) must have different signs. There are now three cases to be examined. 

• a — c — 1, 6 — d, and 6 — cf — 1 are all negative: 

Then the signs of the coefficients of both u 3 and u 2 in F(u) will all be positive. 

• The signs of a — c — 1, b — d, and b — d — 1 are respectively — , + , and +: 

Multiplying / by t~ c (l -t)~ d yields w(t) :=t" c (l - t)~ d - At a - C (l - t) b - d - B, where -c>0, 
a — c>0, — d> 1, and — d + 6>1. The roots of w in (0, 1) can be regarded as the intersections 
of the parametrized curves y — v(x) :— x~ c (l — x)~ d — Ax a ~ c (l — x) h ~ d and y = B. Let 
(xi, yi), . . . , (x n , y n ) be the intersection points of these two curves, where x\ < x 2 < ■ ■ ■ < x n . 
(A simple Jacobian calculation shows that (x$, j/j) is a degenerate root is a degenerate 

root of /. So every (xi, yi) is in fact a non-degenerate root.) Then for all i G {1, . . . ,n — 1} 
there is a Cj € (xj, Xj+i) such that u'(cj) = 0. Thus u' has at least n — 1 roots in (0,1). A 
straightforward computation then yields, 

v'(x) := Ax a - C ~ x {l - x) b - d -\-(a - c)(l - x) + (b - d)x) + x^-^l - x)- d -\-c(l - x) + dx), 
which clearly has the same number of roots in (0, 1) as 

t(x) := Ax a (l - x) b (-{a - c)(l - x) + (b - d)x) - c(l - x) + dx. 

Thus has at least n — 3 roots in (0, 1). Since 

t"(x)/A = x Q - 2 (l-x) b - 2 [-(a-c)a(a-l)(l-x) 3 + a((a + l)(&-d) + 2(6 + l)(a-c))x(l-x) 2 
-6((6 + l)(a - c) + 2(6 - d)(a + l))x 2 (l - x) + (6 - d)b(b - l)x 3 ], 
has as many roots in (0, 1) as 

P(u) := -(a - c)a{a - l)u 3 + a((a + l)(b - d) + 2(6 + l)(a - c))u 2 
-6((6 + l)(a - c) + 2(6 - d)(a + l))u + (6 - d)6(6 - 1) 

has positive roots. Since a — 1 <a — c — 1<0, the coefficients of it 3 and u° in P(it) are both 
positive. Thus P has at most 2 positive roots and we obtain n — 3 < 2. 

• The signs of a — c — 1, 6 — d, and 6 — d — 1 are respectively +, +, and — : 

Since a — c— 1 > and 6 — d> 0, it is easy to see that T(x) < for < x^; 1 and lim x ^i- T(x) = 
— oo. If T(x) has no local minimum, then y = T(x) and y = S{x) have at most one intersection 
point. Otherwise, let Co := min{c | l>c>0, c is a local minimum of T}. The rest of the 
proof is similar to that of Lemma ^. ■ 

Lemma 7. Following the notation of Lemma [a,d > and 6, c < 0/ =>■ r < 6. 

Proof: One last time, Lemma [| tells us that it suffices to prove that M < 3 or N < 5. So let's 
assume that M = 3. Checking signs of coefficients of u 3 and w° of both F(u) and F(u), Descartes' 
Rule of Signs tells us that a — c — 1 < and d — b — 1 < 0. On the other hand, the alternating signs 
of coefficients of it 2 and u 1 of F(u) yield 

2a(6-d+l) + 6(a-c + l) < and a{b - d + 1) + 26(a - c + 1) > 0. 

Thus, -a(6-d+l)+6(a-c+l) = a(d-l)+6(l-c) > 0. But this is impossible since d-l<d-l-6<0, 
1 - c>0, a>0, and 6<0. ■ 

Remark 5. When A = 1.12, B = 0.71, a = 0.5, 6 = 0.02, c = —0.05, and d = 1.8, there are exactly 
5 roots of 1 — Ax a (l — x) f ' — Bx c (l — x) d in (0, 1): They are, approximately. 

{0.00396494, 0.02986317, 0.4354707, 0.72522344, 0.99620026}. 
In particular, this example is nothing more than the univariate reduction from the proof of Theorem 
|^ applied to a small perturbation of Haas' counter-example, o 
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4 A Simple Geometric Approach, a Single Hard Case, and 
the Proof of Part (c) of Theorem [j] 

Let us begin with an extension of Rolle's Theorem to smooth curves in the plane. 
Lemma 8. Suppose C is a smooth 1- dimensional submanifold ofM 2 with: 

1. At most I inflection points that are isolated (relative to the locus of inflection points). 

2. At most N non-compact connected components. 

3. At most V isolated points of vertical tangency. 

Then the maximum finite number of intersections of any line with C isI + N + V + 1. 

Proof: Let S 1 be the realization of the circle obtained by identifying and it in the closed interval 
[0, 7r]. Consider the natural map <f> : C — > S 1 obtained by x i— > 9 X where 9 X is the angle in [0, it) the 
normal line of x forms with the xi-axis. We claim that any 9G S 1 has at most I + V + 1 pre-images 
under tf>. 

To see why, note that by assumption we can express C as the union of no more than I + V + 1 
arcs where (a) any distinct pair of arcs is either disjoint or meets at < 2 end-points, and (b) every 
end-point is either an isolated point of inflection or vertical tangency of C . (This follows easily by 
considering the graph whose vertices are the underlying inflection and vertical tangency points, and 
whose vertex adjacencies are determined by path-connectedness.) Calling these arcs basic arcs, it 
is then clear that the interior of any basic arc is homeomorphic (via </>) to a connected subset of 
5 1 \{0}. We then easily obtain that any 9&S 1 has at most I + V + 1 pre-images under <fi, since each 
such pre-image belongs to exactly 1 basic arc. 

Recall that a contact point of a curve C with a differential system ^ = G(t) is simply a point 
at which some solution of = G(t) has a tangent line in common with C. Now note that any line 
L m ■— {(xi,x n ) S M. 2 | miXi + m2X2 = mo} normal to C forms an angle of ArcTanf ^-J with the 
Xi-axis. Thus, the number of contact points C has with the differential system 

dxi 3x2 

-dT = m2 ' -dT = - mi 

is at most I + V + 1. By Rolle's Theorem for Dynamical Systems in the Plane (see, e.g., [Kho91, 
corollary, pg. 23]), we then obtain that the number of intersections of L m with C is at most I + N + 
V + 1, for any real (mo, mi, TO2) ^ (0, 0, 0). So we are done. ■ 

Remark 6. The bound from Lemma is tight in all cases. This is easily revealed by the examples 
in figure 1 below and their obvious extensions. In particular, one can simply append N — 1 disjoint 
lines to extend any example with N = l to N>1. o 

We are now ready to give a concise geometrically motivated proof of the nearly optimal bound 
A/"(3, 3) < 6. This "second" proof of A/"(3, 3) < 6 was actually the original motivation behind this 
paper and, via a trivial modification, yields the proof of part (c) of Theorem [l] as well. 
Short Geometric Proof of Af(3, 3)<6: Theorem || implies that we can assume that f\ and /2 
have Newton polygons that are each triangles. Lemma [l] of the last section tells us that we can 
assume that f\ = 1 ± X\ ± X2, so we need only check the number of intersections of a line with 
Z + {f2). In particular, since Z + (f2) is diffeomorphic to a line (thanks to Proposition ||), Theorem^ 
tells us that Z + {f2) has no more than 3 inflection points and 1 vertical tangent. By Lemma |^, we 
are done. ■ 

Proof of Part (c) of Theorem [j]: Via a change of variables almost exactly like that of Lemma 
[l] from the last section, we can assume that f\ = 1 ± x\ ± X2- From here, we proceed exactly as in 
our last proof, noting that here Z + (f2) instead has no more than Area(Newt(p)) isolated vertical 
tangents, 3Area(Newt(p)) isolated inflection points, and 2D non-compact components (thanks to 
Theorem |). ■ 
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FIGURE 1 Lemma fjgives a tight bound for N=l and (I, V) G{(0, 0), (3, 1), (4, 1), (3,2), (7, 5)}; 
and this generalizes easily to arbitrary (/, N, V). 

Fewnomial curves happen to admit a simple "fewnomial" description of their inflection points 
and singular points. This fact will be used here to prove our classification of when equality holds in 
our bound J\f(3, 3) < 5 (Corollary and in the proof of Theorem |^ in the next section. Let <9, := . 

Lemma 9. Suppose f : > R is analytic. Then [z is an inflection point or a singular point of 

Z+(f)] => {/(*)=0 and [dlf ■ (3 2 f) 2 - 2dtd 2 f ■ dif ■ d 2 f + d 2 f ■ (dif) 2 ] x = z -0} . In particular, 
in the case where fix) :=p(x ai , . . . , x am ) for some polynomial pGR[<Si, . . . , S m ] and a%, . . . , a m €K 2 , 
the above cubic polynomial in derivatives is, up to a multiple which is a monomial in (xi,x 2 ), a 
polynomial in x ai , . . . , x a ' n with Newton polytope contained in 3Newt(p). 

Proof: In the case of a singular point, the first assertion is trivial. Assuming d 2 fy^0 at an inflection 
point then a straightforward computation of d\x 2 (via implicit differentation and the chain rule) 
proves the first assertion. If $2/ = at an inflection point then we must have d\f 7^ 0. So by 
computing d 2 xi instead, we arrive at the remaining case of the first assertion. The second assertion 
follows routinely from the chain rule. ■ 

Let us now prove our polygonal classification of bivariate trinomial systems with maximally many 
roots in the positive quadrant. 

Proof of Corollary The segment case follows immediately from Corollary [|. For the remaining 
cases, Lemma |l| implies that we can assume f\ :— 1 ± x\ ± x 2 and f 2 :=1 + Axlx 1 ^ + Bx\x\ for some 
real A and B. In particular, it is easily verified that the number of edges of Pp and Pq are the 
equal. 

So let Si := Axlx^, S 2 := Bx\x^^ and let Z := Z + (f 2 ). Observe that Lemma ^ (along with a 
suitable rescaling of f 2 and the variables) tells us that we can bound the number of inflection points 
of Z by analyzing the roots of a homogeneous polynomial in (Si, S 2 ) of degree < 3. So let us now 
explicitly examine this polynomial in our polygonally defined cases. 

Clearly then, the triangle case corresponds to setting a — d > and b = c = 0. We then obtain 
that [x is an inflection point or a singular point of Z] ==> 1 + Si + S 2 = and Si + S 2 = 0. So 
Z has no inflection points (or singularities). It is also even easier to see that Z has no vertical 
tangents. So by Lemma ||, A/"(3, 3) < 2 in this case. To see that equality can hold in this case, simply 
consider F := (x\ + x 2 , - 25, x x + x 2 - 7), which has P F = Conv({(0, 0), (3, 0), (0, 3)}) and root set 
{(3,4), (4,3)}. 

For the quadrilateral case, we clearly have that Newt(/i) and Newt(/2) have exactly two inner 
edge normal vectors (with length 1) in common. So let Vi be the vertex of Newt(/i) incident to both 
the edges of Newt(/i) with these normals. Clearly then, we can assume that our above application 
of Proposition || (which simply involved dividing the fi by suitable monomial terms and performing 
an invertible monomial change of variables) gives us vi = O as well. So we can assume b = c = 
and a, d> 0. We then get the pair of equations 1 + Si + S 2 = and a(d — l)Si — d(a — 1)S 2 = 0, 
with a,d ^ {0, 1}. (If {a, d} n {0, 1} ^ then F, or a suitable pair of linear combination of F, 
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would be pyramidal and we would be done by Theorem ^.) So Z can have at most 1 inflection 
point. It is also even easier to see that Z has no vertical tangents. So by another application of 
Lemma ||, A/"(3, 3) < 4 in this case. To see that equality can hold in this case, simply consider the 
system [x\ - 3xi + 2,x\ - 3x 2 + 2), which has P F = Conv({0, (2,0), (2,2), (0,2)}) and root set 
{(1,1), (1,2), (2,1), (2,2)}. 

As for the pentagonal case, we can again assume (just as in the quadrilateral case) that our 
application of Proposition [2] placed the correct vertex of Newt(/i) at the origin. In particular, 
we can assume b = and a, c, d > 0. We then get the pair of equations 1 + Si + S 2 = and 
a 2 (d - l)Sf + a(ad - d - 2c)SiS 2 - c(c + d)5| =0, with ac(d - l)(c + d)^0. (Similar to the last 
case, it is easily checked that if the last condition were violated, then we would be back in one 
of our earlier solved cases.) However, a simple check of the discriminant of the above quadratic 
form in (Si, S 2 ) shows that there is at most 1 root, counting multiplicities, in any fixed quadrant. 
So, similar to the last case, we obtain J\f(3, 3) < 4 in this case. To see that the equality can 
hold in this case, simply consider the system {x\ — 7x 2 + 12,-1 + — x\), which has P F = 

Conv({O,(2,0),(2,2),(l,3),(0,2)}) and root set {(3,^^) , (4,2± v / 3)}. ■ 

5 Monomial Morse Functions and Connected Components: 
Proving Theorem [2| 

Let us begin with a refinement of a lemma due to KhovanskiQ Recall that a d-flat in R" is simply 
a translate of a d-dimensional subspace of W 1 . 

Lemma 10. Let a\, . . . , a m S M. n , c\, . . . , Cm 6l, and let Z be the zero set of YllLi Ci& ai ' z in M. n , 
where ■ denotes the usual Euclidean inner product of vectors in 1". Then there is an (n — I) -flat in 
M" which intersects at least half of the non-compact connected components of Z . Furthermore, the 
set of unit normal vectors of all such (n — 1) -flats is non-empty and open in the unit (n — 1) -sphere. 

For the con venienc e of the reader, we sketch a proof below, based on an argument of Jean- Jacques 



Risler from [Pis85j Sec. 2]. 

Proof: Let us call any function of the form described in the statement of the theorem an exponen- 
tial m-sum. Let Zi be any non-compact component of Z and d any connected unbounded curve 
(defined by a system of exponential m-sums) lying in Zi. Let pi be any limit point (as \\z\\ — > +00) 

of the set j z G C\ \ . That the set of such limit points is in fact finite follows easily from a 



slightly more general version of Khovanski's Theorem on Real Fewnomials [Kho91, Cor. 6, Pg. 80, 
Sec. 3.12], stated in terms of exponential sums. 

If H is any hyperplane (so O € H ) such that Pi^H for all i then one of the open unit hemispheres 
defined by H contains at least half of the points pi. In particular, note that such an H must clearly 
exist and any hyperplane H' with unit normal vector sufficiently near that of H will also define 
an open unit hemisphere containing at least half the pi. To conclude, note that any (n — l)-flat, 
parallel to H and far enough in the direction of the pole of the hemisphere containg the most pi, 
will intersect half of the Cj and thus half of the Z». So we are done. ■ 

We will also need an extension of the classical bounds on the number of connected components 
of a real algebraic set. 

Lemma 11. Given any fi-sparse k x n fewnomial system F , the number of connected components 
of Z+(F) is no more than 2™-^(2n + 1)^2^^ +1 )/ 2 . ■ 



The smooth case, which admits a sharper bound, is detailed in |Kho91, Sec. 3.14]. The special case 



of integr al exponents (allowing degeneracy) is nothing more than |Roj00a , Cor. 3.2] and the proof in 



[ RojOOa , Sec. 3.2] extends with no difficulty to real exponents. One can in fact genera lize th e above 



lemma to semi-Pfaffian sets, provided one loosens the stated upper bound somewhat | Zcl99 |. 

4 In |Ris85 , Risler outlines a proof of the first portion of Lemma hoj, in the special case where all the a,i lie in Z". 
He also cites a paper of Khovanski for further details. However, as far as the authors can tell, the wrong paper by 
Khovanski was cited. 
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A construction which will prove quite useful when we count components via critical points of 
maps is to find a monomial map which is a Morse function [ Mil63| relative to a given fewnomial 



zero set. Recall that an n-dimensional polyhedral cone in M. n is simply a set of the form 
{r\a\ + ■ • • + r n a n | n, . . . ,r n > 0}, where {ai, . . . , a n } is a generating set for 1". In particular, an 
n-dimensional cone in R n always has non-empty interior. 

Lemma 12. Suppose f is an n-variate m-nomial. Then there is an n-dimensional polyhedral cone 
KCR n such that a£ A"\{0} implies 

1. Every critical point of the restriction of x a to Z + (f) is non- degenerate. 

2. The level set in Z + (/) of any regular value of x a has dimension < n — 2 . 

3. No connected components of Z + {f) other than isolated points are contained in any level set of 
x a . 

4- At least half of the non-compact connected component of Z+(f) have unbounded values of x a . 

Proof: Set Z := Z + {f). Let us first show how assertion (3) can be attained: Since the number 
of components of Z is finite by Lemma [0], we can temporarily assume that Z consists of a single 
component. Then, if we could find n linearly independent a € M™ with Z C {x £ \ x a — c a } for 
some c a , Proposition]^ would immediately imply that Z is contained in a point. So condition (3) 
can be enforced. 

To ensure the truth of condition (4), note that we can perform the substitution (xi, ...,x n ) = 
(e Zl , . . . , e z " ) to reduce to finding an (n — l)-flat intersecting at least half of the non-compact compo- 
nents of the real zero set of an exponential m-sum. That such a hyperplane exists (and the fact that 
a small open neighborhood of such hyperplanes exist) then follows from Lemma |l^, so condition (4) 
holds. 

To enforce conditions (1) and (2) let us maintain our last change of variables. A simple derivative 
computation (noting that x i— ► (e Xl , . . . , e Xn ) defines a diffeomorphism from M™ to R") then shows 
that it suffices to instead prove the analogous statement where / is replaced by a real exponential 
sum (a real analytic function in any event) and x a is replaced b y the line ar form a\Z\ + • • • + a n z n . 



The latter analogue is then nothing more than an application of BCSS98 , Lemma 1, Pg. 304]. Since 
the number of components of Z is finite, we thus obtain assertions (1) and (2). 

Our lemma then follows by intersecting the four sets of a we have just determined, and noting 
(thanks to Lemma |l0|) that the intersection clearly contains a cone over a small (n — l)-simplex. ■ 
Note that if we omit condition (4) then we can make conditions (l)-(3) hold in an even larger set 
of a: all a in K™ outside a finite union of hyperplanes. 

Example 3. In general, one can not find an a with every component of Z + (f) giving unbounded 
values for x a . This follows from an elementary calculation with n = 2 and 

f(x, y):=(l- x- xy--) ( 1 - y - xy ] \ 



v J \ x J \ x y 

showing that, for all (01,02), x ai y a2 has unbounded values on no more than 2 of the 3 components 
of Z + (f). The authors thank Daniel Perrucci and Fernando Lopez Garcia for this example, o 

We will also need the following useful perturbation result, which can be derived via Sard's 



Theorem [Hir94] and a simple homotopy argument. (See, e.g., [Bas99, Lemma 2] for even stronger 



results of this form in the setting of integral exponents and zero sets in K™.) 

o 

Lemma 13. Let f be any n-variate m-nomial, Z+(f) the solution set of \ f\ <S in K™ , and Z+ (/) 

o 

the boundary of Z^_(f). Then for 5>0 sufficiently small, Z+ (/) is smooth and has at least as many 
compact (resp. non-compact) connected components as Z + {f). ■ 
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Example 4. If f(x,y):=x 2 + (1 — xy) 2 then note that Z + (f) is empty while, for any S>0, Z^_(f) 

contains the point (\/S, -^=j . So Z^_(f) need not have the exactly same number of compact (or 

non- compact) components as Z + (f), even if 5>0 is very small. The authors thank Daniel Perrucci 
for this example, o 

Example 5. Boundaries of tubes about analytic sets behave a bit differently in R 1 ? than in R™. 



For instance, unlike the analogous bound overW 1 (see, e.g., [RojOOa, Lemma 3.1]), the number of 

o 

components of Z + (f) can not be bounded above by half the number of components of Z^_ (/); taking 

o 

f(x.y) = Yii=i(y ~ i x )> it * s easily checked that Z + (f) has exactly D components, while Z+ (/) has 
exactly D + 1 components for 5>0 sufficiently small, o 

Proof of Theorem For simplicity, let us assume that all components are connected and lie in 
R™. 

Assertion (0) follows immediately from Proposition g, UGDRS, and noting that is an n-variate 
polynomial with exactly one non-compact connected component in its real zero set. 

To prove assertion (1), note that assertion (2) of Theorem || (which we'll soon prove below) and 
assertion (4) of Theorem || easily imply the first formula. Proposition || then tells us that the second 
equality can be proved simply by employing a monomial change of variables to reduce to the case 
of an (m — l)-variate m-nomial g. In particular, since m>3, every component of Z+(g) will still be 
non-compact. (This is clear from another application of Proposition ||, separating the cases where 
dim Newt (g) equals, or is strictly less than, m.) Moreover, dim Newt (5) = m => the number of 
non-compact components is exactly 1. So we can assume that dimNewt(<7) <m, use Proposition ^ 
one last time, and then intersect with an appropriate coordinate flat to derive the final inequality. 
So assertion (1) is proved and we can assume henceforth that n>2. 

To prove assertion (2), let us first construct a concrete family of examples realizing the lower 
bound: Consider the polynomials 

/ n \ /|m/2j-n-l \ n \_{m-l)/{2n)\ 

9l {x):= m^-l) 2 + J] (*i-*) 2 and 92(x):=Yl U ^ 0' • 

\i=2 / V i=l / 3=1 i=l 

Clearly, g\ and gi respectively have exactly 2[to/2J and 2n [_(m — l)/(2n)J monomial terms. From 
the basic fact that a 2 + b 2 = ==>• a = b = for all real (a, b) , it is easily checked that the numbers 
of roots of g\ and 32 in R+ are finite and in fact are identically the formulae embedded in our 
lower bound. More precisely, we immediately obtain our lower bound for a restricted class of 
m depending on the congruence class of m mod 2 or mod In. This restriction can easily 
be removed by adding additional monomial terms in such a way that the number of compact and 



non-compact components is not decreased. To do this, simply note that by Sard's Theorem [Hir94] 
(and the definition of an ?i-sphcrc), we have that Z + {gi — So) is smooth and has the same numbers of 
compact and non-compact components as Z + {gi) for all S$ >0 sufficiently small. Similarly, the same 
will then be true of Z + {x\(gi — 5q) — Si), for S\ >0 sufficiently small, and the latter polynomial has 
exactly 1 more monomial term than gi. Proceeding inductively, we can thus remove our restriction 
on m, and we thus obtain the lower bound of assertion (2). 

To prove the upper bound of assertion (2), note that we can divide by a suitable monomial so 
that / has a nonzero constant term. By Lemma |l3|, we then have that for S>0 sufficiently small, it 

o 

suffices to bound the number of compact components of Z+ (f) — an "envelope" of Z + (f ). Recall 

o 

that Lemma [l3| also grants us that Z^_ can be assumed to be smooth. 

By Proposition || and Lemma |l2|, we can then pick annxn matrix A so that, after we make the 

o 

change of variables x = y A , the number of compact and non-compact components of Z^_ is preserved, 

o 

no component of Z 6 , of positive dimension is contained in a hyper-plane parallel to the yi-coordinate 
hyperplane, and we can use critical points of the function y\ to count compact components. 
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Consider then the systems of equations G± :— (/ ± 5, y^d^f, ■ ■ ■ , y n d n f), where 9, = here. By 

o 

construction, every compact component of Z+ (/) results in at least two extrema of the function yi, 
i.e., i 3 comp (n, to) is bounded above by an integer no greater than half the total number of roots of 
G+ and G_. (In particular, if Z+(f) were smooth to begin with, then wc could have omitted the 

o 

use of (/) and G±, since P C omp(^j m ) would instead be bounded above by an integer no greater 
than half the number isolated roots of G := (/, y^d^f, ■ ■ ■ , y n d n f).) Note also that by assertion (1) 
of Lemma |l2|, all the roots of G± (and G) are non-degenerate. Furthermore, G± (and G) clearly 
has no more than to distinct exponent vectors, so the upper bound on P CO mp(n,m) holds. As for 
the number of compact components of Z + (p) , the preceding argument applies as well, so we need 
only observe that p±S and yidip are both of the form q(x Vl y Sl , x U2 y V2 ) where Newt (q) = Newt (p). 
So assertion (2) follows. 

To prove assertion (3), let us construct another family of explicit examples: Consider the poly- 
nomials 

m-l ra-l L(™-l)/(2«-2)J 

h\(x) := Y[ ( x i - i) an d h 2 {x) -=2_j Yl ( x i _ ^ 

i—l j—1 i—1 



Clearly, hi and hi respectively have exactly m and 2(n — l)[(m — l)/(2n — 2)J monomial terms. 
Note also that Z+(hx) and Z + (hi) have only non-compact connected components, and the numbers 
of such components are in fact the formulae embedded in our lower bound (for a restricted class of 
to) . The lower bound of assertion (3) then follow easily, mimicking the argument we used earlier to 
remove the congruence class restriction which arose during the proof of the lower bound of assertion 
(2). 

To prove the upper bounds of assertion (3), let us work directly with / and make an independent 
application of Lemma [l^. We can then apply Proposition ^ to make a change of variables x — y A 
(preserving the number of compact and non-compact components of Z+(/)), so that at least half of 
the non-compact components of Z + (f) have unbounded values of y\. So, for e>0 sufficiently small, 
the number of non-compact components of Z + (f) is no more than twice the number of components 
of the intersection Z' :=Z + (f) n{?/€:R™ | y% = So by substituting yi = \ into /, we obtain a new 
m-nomial hypersurface Z"CR n_1 with at least as many components as Z' . So Z" has at least half 
as many components as Z + (f) has non-compact components, and thus the number of non-compact 
components of Z + {f) is no more than 2P(n — l,m). So the upper bound on P non (n,m) is proved. 
As for the number of non-compact connected components of Z + (p), the preceding argument still 
applies. So we need only observe that, modulo a monomial change of variables via Pr opositio n 0, p 
can be assumed to be a polynomial of degree D. Lemma |l^ and Bezout's Theorem lSha77t ex. 1, 



pg. 198] (along with an exponential change of variables) then proves what is left of assertion (3). 
To prove assertion (4) simply note that the isolated points of vertical tangency of Z + (f) are 

exactly the isolated roots of the bivariate fewnomial system H := {j > x 2-§^j ■ When f(x) — 
p(x ai , . . . ,x am ) for some . . . ,S m ], a simple application of the chain rule then shows that 

x 2§^ — q{x ai , . . . , x am ) for some a%, . . . , a m € M 2 and some q £ R[Si, . . . , S m ] with Newt(g) C 
Newt(p). In particular, p = 1 + Si + ■ ■ ■ + S m q is a homogeneous linear form in Si, ... , S m , 
so the first part of assertion (4) follows. To prove the second part, note that (ri,si) and (142,1*2) 
linearly dependent ==> Z+ (p) is a union of no more than two binomial curves (via Proposition ^ and 
factoring over R), and such curves have no isolated points of vertical tangency in R 2 ^. So, assuming 

det ri " 2l 



Si v 2 



^0, Proposition U then tells us that it suffices to count the isolated roots (Si, S%)€: 1 



of the 2x2 polynomial system H. By Bernstein's Theorem [BKK76|, the number of complex isolated 
roots of the resulting system is at most Area(Newt(p)). So assertion (4) is proved. 

To prove assertion (5), first note that [m = l Z+(f) is empty] and [to = 2 ==>• Z + (f) has no 
isolated inflection points] . So we can assume to = 3 and that / has a constant term. Note then that 
by Lemmata |l| and ||, (xi,X2) is an inflection point of Z + (f ) f(x) = q(x Tl y Sl , x U2 y V2 ) — 0, where 
q G R[Si, S2]. In particular, Lemma ^| tells us that q is either a homogeneous cubic polynomial or 
a polynomial with Newton polytope contained in 3Newt(p), according as we focus on the first or 
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second part of assertion (5). Just as in the last paragraph, we can also assume that det 



n u 2 
si v 2 

2 



and thus reduce to counting the isolated roots in R^ of a 2 x 2 polynomial system in (Si, S2). For the 
first part of assertion (5), the fundamental theorem of algebra tells us that q splits completely over 
C[Si, S2], so we can further reduce to no more than three 2x2 linear systems and easily obtain our 
bound of 3/C'(2,m). For the second part, we can easily conclude by Bernstein's Theorem |BKK76| . 
So assertion (5) is proved. 

To prove the last observation of Theorem ||, note that by Proposition [l] and Lemma |l|, Z + (f) 
has a singularity =4* Newt(/) is a line segment, and then / must be the square of a binomial. So 
the case where Z + (f) is singular follows immediately. The case where Z + (f) is smooth then follows 
easily from assertions (4) and (5), since Theorem |] of Section || implies that /C'(2, 3) — K.(2, 3) = 1. 



6 Momenta, Polytopes, and the Proof of Theorem [3| 

Let S and Int(S) respectively denote the topological closure and topological interior of any set S, 
and let Rellnt(Q) denote the relative interior of any d-dimensional polytope Q C R™, i.e., Q\R 
where R is the union of all faces of Q of dimension strictly less than d (using as the only face 
of dimension < 0). We then have the following variant of the momentum map from symplectic 
geometry |Sma70| , |Sou70|| . 



Lemma 14. Given any n- dimensional convex compact polytope P C R n , there is a real analytic 
diffeomorphism i\)p : R™ — > Int(P). In particular, if f is an n-variate m-nomial with Newt(/) = P 
(so dim Newt (f) — n) and w GM. n \{0} , then ipp(Z + (f)) has a limit point inRe\lnt(P w ) =>■ lmt w (f) 
has a root in R™. Moreover, there is a real analytic diffeomorphism between Z + (Init w (f)) CR™ and 

(Re\lnt(P w ) n VpOM/))) x R ? ;- dimpl " . 

Proof: By ]Ful93| , Sec. 4.2, Lemma, Pg. 82], the map cp : R n — ► Int(P) defined by 



(x):= 




is a real analytic diffeomorphism. Composing coordinate-wise with the logarithm function, we then 
obtain that 

ipp{x):= ^2 px p / x q 

p a vertex of P I q a vertex of P 

yields our desired real analytic diffeomorphism from R™ to lnt(P). 

The remainder of the lemma follows easily via a monomial change of variables. In particular, 
the special case where P can be defined by a finite set of inequalities with rational coefficients is 
already embedded in the theory of toric varieties, e.g., Ful93| , Prop., Pg. 81]. The general case of 



arbitrary polytopes in R" can be proved as follows: Let d := dim P w , let v 1 , . . . , v n _d & Q n be any 
linearly independent normal vectors of P w , and let i;„_d+i, . . . ,v n <EQ n be any linearly independent 
vectors parallel to P w . Then, letting A be the inverse of the n x n matrix whose i— column is u,; 
for all . • • ,n}, we can clearly write f(y A )=g(yi, ■ ■ ■ ,y n ), where 



(yi,...,y n ):= V y^ 1 ■ ■ ■ y^ d d g a (y n - d+ i, . . . , y n ), 



( ,ai _ _ „, a »-<i, 
a:=(ai,...,a„_j) 



the sum ranges over {(vi ■ a, . . . , u n _d • a) | aeSupp(/)}, and for any such a there is an m a such 
that g a is a e?-variate m a -nomial. Most importantly, if 



f3=\ min {vi-a},..., min {u„„ d • a} 

v a£Supp(/) aeSupp(/) 
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then (yx, . . . , y n _ d ) l3 gp(y n - d+1 , . . . , y n ) = lnit w (f)(y A ). Clearly then, ip P (Z + (f)) has a limit point 
in RelInt(P"') =4> there is an M >0 such that Z + (f) intersects 

( f] (xeRl | x Vi >^-\ j n ( P| {xeRl \ x Vi <M} J n f| |xeR'; | x Vi = £i } 

for all £j > sufficiently small, since P w is compact and ij)p is a diffeomorphism. By Proposition |^, 
the map x i— » x" 4 is a difFeomorphism, so there is also an M'>0 such that Z + (g) intersects 

f] {yeR% \ yi <M'} ) n{ yi = Si \i€{l,...,n-d}} 





for all 5i >0 sufficiently small. By our formula relating g and 5/3 (and the fact that Z+(g) is locally 
closed, being the zero set of a continuous function), we then have that Z + {gp) intersects the last set 
as well. So Z+(gp), and thus 2T+(Init „,(/)), is non-empty. 

To conclude, a routine monomial change of variables shows that 

pa vertex of P w 

gives us our desired real analytic diffeomorphism. ■ 

Note that the converse of Lemma |lj need not hold: A simple counter-example is 

f(x, y) = (x 2 +y 2 - l) 2 + (Z - l) 2 and w = (0, 1). 

We also point out that the easiest way to understand the above lemma is to take any example / 
with Newton polytope identical (near the origin) to the nonnegative orthant, and then note that 
one is in essence compactifying R™ by adding coordinate subspaces, as well as some other pieces 
which are images of (R*) fe under monomial maps. Indeed, the monomial change of variables in 
our proof essentially results in an invertible affine map which sends a <i-dimensional face of P to a 
d-dimcnsional coordinate subspace of IR ra . 

Theorem ^ then follows easily from a refinement of the last lemma. 

Lemma 15. Following the notation of Lemma assume that Z + (Imt w (f)) is smooth for all 
weR n \{0}. Then 



1. For any facet Q of P, every connected component of Rellnt(Q) fl ipp(Z+(f)) is an (n — 2)- 
manifold which is the set of limit points in Rellnt(Q) of ipp(C) for some unique non-compact 
connected component C of Z + (f). 

2. C a non-compact connected component of Z + {f) ipp(C) has a limit point in Rellnt(Q) for 
some inner facet Q of P. 

Proof: To prove (1), first note that the last portion of Lemma [li] already tells us that every 
connected component of RelInt(Q)n , 0p(Z+(/)) is an (n— 2)-manifold, since Z + (lnit w (f)) is smooth 
for all w. (Indeed, the number of connected components of any Z + (Init u) (/)), and thus Rellnt(<5) n 
ipp(Z+(f)), is finite by Lemma O.) Furthermore, it is clear that every connected component of 
RelInt(Q)nV'p(.Z+(/)) must be the set of limit points of some collection of non-compact components 

of Z + (f). 

To see why a component of Rellnt(Q) n ipp(Z+(f)) can be the limit set of just one non-compact 
component of ipp(Z + (f)), we can specialize the monomial change of coordinates from the proof 
of our last lemma as follows: Let w be any nonzero inner facet normal vector of Q and let A 
be any invertible n x n matrix such that Aw is the first standard basis vector. Also let 5 be 
the minimum value of w ■ a as a ranges over Supp(/). Lemma |l4| and Proposition |^ then tell 
us that ipp(C) is diffcomorphic to some non-compact component of Z + (g) where g(yi, ■ ■ ■ , y n ) = 
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f(y A ) = V\9s{V2, ••■,Vn) + J2 a Vi9a{V2, ■ • ■ , Vn) , the sum ranges over {w ■ a > 6 a G A}, and 
Vi9s{y2, ■ ■ ■ iVn) — I n it lu (/)(y" 4 ). Dividing out by yf, we then obtain by Proposition |, the implicit 
function theorem [ Rud76|, Thm . 9.28, Pg. 224], and a simple induction on dimQ that every connected 



component of U fl ipp{Z + {f)) is a connected (n — l)-dimensional quasifold [PraOl BP02|, for some 
neighborhood U of Q in P. So assertion (1) is proved, with the additional strengthening that for all 
w G M™\{0}, every component of RelInt(P tu ) n tpp(Z+(lriit w '(f))) is the limit set of some unique 
non-compact component of Z + (Imt w >(f)), where P w is a face of dimension 1 + dimP 1 " and w'gl". 

To prove (2), note that ipp(C) must be a non-compact subset of P and a closed subset of 
Int(P). Since P is compact, ipp(C) must therefore be compact and contain a point in dP. So 
now let Q be the face of highest dim ension d such that tpp(C) intersects Rellnt(Q). By assertion 



(1) (and the definition of a quasifold [PraOl, Sec. 1]), dP H iftp(C) must be an (n — 2)-dimensional 
quasifold with only finitely many connected components. Lemma |l4j then tells us that d<n — 1 ==>■ 
dim(Q H ipp(C)) < d, since lnit w (f) is not identically zero. So if d< n — 1 we must then have that 
dPC)tpp(C)= (J Rellnt(Q') n ipp(C) has dimension strictly less than n — 2, thus contradicting 

Q' a face of P 

assertion (1). So d = n — 1 and assertion (2) is proved. I 

Note that the smoothness hypothesis of Lemma ^| in fact implies that every non-compact connected 
component of Z+(f) contains an [n — l)-dimensional manifold. Note also that the smoothness 
hypothesis (at least for w that are inner facet normals) is necessary for assertion (1). 

Example 6. Consider f(x, y) := (x + y — l)(y — x + 1). Then Z+(f) consists of a exactly 2 disjoint 
rays and Init(Q,i)(/) = — (x — l) 2 has a degenerate root at 1. In particular, (1,0) is a limit point of 
both the rays of this Z + {f). o 

Proof of Theorem |^: Let Z := Z + (f). By Lemma |l5|, the number of non-compact connected 
components of Z is no more than J2 W where the sum ranges over all unit inner facet normals 
of P = Ncwt(/) and N' w is the number of connected components of RelInt(P™) n tpp(Z+(f)). By 
Lemma [l4[ ^L—J2 W N w > so the first part of theorem ||| is proved. 

The second assertion is then a trivial consequence of the first via the definition of P(n, m). 

To conclude, assertion (0) of Theorem || easily implies that for n < 2 our penultimate bound 
specializes to exactly the number of points of Supp(/) on the boundary of Newt(/), regardless of 
whether Z is smooth or not. To halve this bound, simply note that for smooth Z, every non-compact 
component C of Z is homeomorphic to an open interval. Therefore, by Lemma [l5], \pp{C) must 
intersect the boundary of P exactly twice. So we are done. H 

Remark 7. Bertrand Haas has pointed out that the very last assertion of Theorem [3| ( concerning 
non-compact connected components of m-nomial curves in ), in the case of integral exponents, 
follows easily from work of Isaac Newton published in 1744 ljNew44 , Book I, Chap. 3]. The relevant 



result of Newton relates Puiseux seri es and diagrams involving the portion of Newt(/) visible from 



the origin, and can also be found in , Coo5i , Chap. II, Paragr. 1, Pg. 213]. 
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Konstantin Alexandrovich Sevast'yanov was born on January 21, 1956 
in Astrakhan (an old Russian town on the Volga river) and graduated from 
a very famous mathematical high school organized by Andrey Nikolaevich 
Kolmogorov. At the age of 17 he was a winner of the International Math- 
ematical Olympiad for high school students and thus skipped his entrance 
exams to become a mathematics student at Moscow State University. His 
supervisor was Anatoly Georievich Kushnircnko, and Vladimir Igorevich 
Arnold and Askold Georgevich Khovanski also supervised Sevast'yanov's 
research. Sevast'yanov was a gifted student but suffered from poor health 
throughout his life. He formulated, around 1979, the key result that inspired 
Khovanski to create Fewnomial Theory. Sevast'yanov eventually went on 
permanent leave as his illness worsened and on December 7, 1984 he was 

killed after apparently being struck by a car. 

Konstantin Alexandra™ wast yaw, around 198o 

Those who bless us with beautiful results should never be forgotten, even if tragedy obscures their 
accomplishments. We therefore dedicate this paper to the memory of Konstantin Alexandrovich 
Sevast'yanov. 
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